Crowdsourced Semantic Matching of Multi-Label Annotations
نویسندگان
چکیده
Most multi-label domains lack an authoritative taxonomy. Therefore, different taxonomies are commonly used in the same domain, which results in complications. Although this situation occurs frequently, there has been little study of it using a principled statistical approach. Given that (1) different taxonomies used in the same domain are generally founded on the same latent semantic space, where each possible label set in a taxonomy denotes a single semantic concept, and that (2) crowdsourcing is beneficial in identifying relationships between semantic concepts and instances at low cost, we proposed a novel probabilistic cascaded method for establishing a semantic matching function in a crowdsourcing setting that maps label sets in one (source) taxonomy to label sets in another (target) taxonomy in terms of the semantic distances between them. The established function can be used to detect the associated label set in the target taxonomy for an instance directly from its associated label set in the source taxonomy without any extra effort. Experimental results on real-world data (emotion annotations for narrative sentences) demonstrated that the proposed method can robustly establish semantic matching functions exhibiting satisfactory performance from a limited number of crowdsourced annotations.
منابع مشابه
Separate or joint? Estimation of multiple labels from crowdsourced annotations
Artificial intelligence techniques aimed at more naturally simulating human comprehension fit the paradigm of multi-label classification. Generally, an enormous amount of high-quality multi-label data is needed to form a multi-label classifier. The creation of such datasets is usually expensive and timeconsuming. A lower cost way to obtain multi-label datasets for use with such comprehension–si...
متن کاملImproving Classification by Improving Labelling: Introducing Probabilistic Multi-Label Object Interaction Recognition
This work deviates from easy-to-define class boundaries for object interactions. For the task of object interaction recognition, often captured using an egocentric view, we show that semantic ambiguities in verbs and recognising sub-interactions along with concurrent interactions result in legitimate class overlaps (Figure 1). We thus aim to model the mapping between observations and interactio...
متن کاملEfficiently Scaling Up Video Annotation with Crowdsourced Marketplaces
[1] Yuen, J., Russell, B., Liu, C., Torralba, A.: LabelMe video: Building a Video Database with Human Annotations. (2009) [2] Vijayanarasimhan, S., Grauman, K.: Whats It Going to Cost You?: Predicting Effort vs. Informativeness for Multi-Label Image Annotations, CVPR (2009) [3] Sorokin, A., Forsyth, D.: Utility data annotation with amazon mechanical turk. Urbana 51 (2008) 61820 Interactive Vide...
متن کاملEstablishing Relationships between Emotion Taxonomies Using the Vector Space Model
Due to different aspects that emotion-oriented research looks to capture, the emotion taxonomy used often differs among research efforts. Therefore, it is hard to coordinate the research efforts using different emotion taxonomies. On the other hand, due to the multiplicity of “emotion”, emotion annotations more naturally fit the paradigm of multi-label classification since one instance (such as...
متن کاملRobust Online Gesture Recognition with Crowdsourced Annotations
Crowdsourcing is a promising way to reduce the effort of collecting annotations for training gesture recognition systems. Crowdsourced annotations suffer from ”noise” such as mislabeling, or inaccurate identification of start and end time of gesture instances. In this paper we present SegmentedLCSS and WarpingLCSS, two template-matching methods offering robustness when trained with noisy crowds...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015